Automatically generated media preview

ABSTRACT

Systems, devices, apparatuses, components, methods, and techniques for automatically generating media previews are provided. An example media system for automatically generating media previews for a particular artist include a trailer generation application configured to receive input specifying an artist and duration of a trailer, automatically select clips from two or more media items by the artist, and automatically arrange and combine the clips into a media trailer for later playback.

BACKGROUND

People in the music industry want to promote music-related events topeople who listen to music. A good way to do this is by providingpreviews of an artist's music. However, producing trailers or previewsof a particular artist's music can be time-consuming and costly forpromoters. Promoters need a quick, easy way to provide a preview of anartist's music to a listener to promote an event associated with thatartist.

SUMMARY

In general terms, this disclosure is directed to automaticallygenerating media trailers of an artist's music. In one possibleconfiguration and by non-limiting example, a promoter provides inputspecifying an artist and a duration of a trailer to a server computer,which automatically generates a trailer, or preview, of that artist'smusic by combining multiple clips of that artist's songs into onecohesive media item. Various aspects are described in this disclosure,which include, but are not limited to, the following aspects.

In one aspect, method of generating a media trailer comprises receivinginput from a user specifying one or more artists and a duration of themedia preview. The input is used to automatically select one or morerepresentative media items for the one or more artists and automaticallyselect clips of those representative media items. Then the clips areautomatically arranged and equalized to generate the media trailer.

In another aspect, one or more computer readable media encodeinstructions that cause a computing device to receive a selection of oneor more musical artists, receive an input specifying a duration of anartist trailer, and receive an input of additional media content relatedto an event involving the one or more musical artists. The computingdevice automatically selects one or more representative songs for theone or more musical artists and selects portions of each song to includein the artist trailer. The portions of the songs are automaticallyarranged and equalized. The additional media content is integrated withthe portions of the songs to produce the artist trailer and then apreview of the artist trailer is presented.

In yet another aspect, a system for generating an artist trailercomprises a trailer generation application and a media serverapplication. The trailer generation application is configured to receiveinput from a promoter specifying an artist and a duration for the artisttrailer, automatically select clips from two or more songs by theartist, and automatically arrange and combine the clips into an artisttrailer. The media server application is configured to communicate mediacontent and artist trailers to a consumer. In some embodiments thesystem further comprises a computing device and a media-playback device.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an example system for automatically generating mediatrailers.

FIG. 2 is a schematic block diagram of the computing device 102 of FIG.1.

FIG. 3 is a schematic block diagram of the media server of FIG. 1.

FIG. 4 is a schematic block diagram of the media-playback device of FIG.1.

FIG. 5 illustrates an example method of automatically generating a mediatrailer performed by some embodiments of the trailer generationapplication of FIG. 3.

FIG. 6 illustrates an example method of receiving input of trailercriteria performed by some embodiments of the trailer generationapplication of FIG. 3.

FIG. 7 illustrates a view of an example trailer generator interfaceimplemented on the computing device of FIG. 1.

FIG. 8 illustrates an example method of automatically selecting clips ofrepresentative media items performed by some embodiments of the trailergeneration application of FIG. 3.

FIG. 9 show an example of media items selected to be included in a mediatrailer according to the method of FIG. 8.

FIG. 10A illustrates example analysis to identify song structureelements performed to select clips of media items according to themethod of FIG. 8.

FIG. 10B illustrates example analysis of song volume performed to selectclips of media items according to the method of FIG. 8.

FIG. 10C illustrates example analysis of song energy performed to selectclips of media items according to the method of FIG. 8.

FIG. 11 illustrates an example method of automatically assemblingselected media clips into a media trailer performed by some embodimentsof the trailer generation application of FIG. 3.

FIG. 12 illustrates the process of arranging media clips according tothe method of FIG. 11.

FIG. 13 illustrates the process of equalizing media clips according tothe method of FIG. 11.

FIG. 14 illustrates the process of cross-fading between media clipsaccording to the method of FIG. 11.

FIG. 15 illustrates an example trailer generation user interface 1300showing a preview of the media trailer on the computing device of FIG.1.

FIG. 16 illustrates an example display of a media trailer on themedia-playback device of FIG. 1.

DETAILED DESCRIPTION

Various embodiments will be described in detail with reference to thedrawings, wherein like reference numerals represent like parts andassemblies throughout the several views. Reference to variousembodiments does not limit the scope of the claims attached hereto.Additionally, any examples set forth in this specification are notintended to be limiting and merely set forth some of the many possibleembodiments for the appended claims.

Trailers or previews are effective tools for providing an audience witha sample of a particular media content item or collection of mediacontent items. In some instances, trailers are used to provide a previewof media content that is going to be released to the public. In otherinstances, trailers serve as a reminder of media content that anaudience might already be familiar with.

Music trailers could be utilized to promote a particular artist or groupof artists. In one example, a trailer could provide a preview of musicthat is going to be released in an upcoming album for a particularartist. In another example, a trailer could remind an audience of themusic that one or more artists have created that might be featured in anupcoming live performance. The trailer could feature more than oneartist if multiple artists are featured in the live performance. Inanother example, a trailer could provide a preview of music by a newartist to attract listeners.

Events related to a particular artist are preferably promoted to anaudience that listens to that particular artist. Multiple platforms areavailable to reach such an audience. For example, a media trailer couldbe played between songs on a music streaming service. In anotherexample, a media trailer could be played on social media platforms suchas Snapchat or Facebook. Many such platforms allow for targetedpromotion of individuals that might be particularly interested inlearning about an event relating to an artist that the individuals enjoylistening to. In some embodiments, the event is a concert orperformance. In other embodiments, the event is the release of a newalbum or the addition of a new artist to a music streaming service.

Promoters want to quickly and easily generate media trailers that aretargeted to particular audience. It is costly and time consuming tocreate customized promotional media previews for every event and everyaudience. In some embodiments, a system automates several steps of thisprocess for promoters. By providing some simple inputs, a promoter canspecify criteria for a customized media trailer that is automaticallygenerated. The promoter can include detailed additional content tocustomize the media trailer for a particular event. In otherembodiments, media trailers are automatically generated with little orno input from a promoter. For example, a new artist may be available ona music streaming service and a media trailer is automatically generatedfrom the artist's music to enable listeners to quickly preview the newmusic.

FIG. 1 illustrates an example system for automatically generating mediatrailers. A promoter P utilizes a computing device 102 to enter inputinto a user interface 118. The input specifies criteria for a mediatrailer that will be used to promote the event 122. The media server 106receives the input through a network 104 and a trailer generationapplication 108 automatically generates a media trailer. The mediatrailer is then communicated through the network 104 to a consumer 110via a media-playback device 112. The media-playback device 112 plays themedia trailer 120 which includes information about the event 122, whichis a Lady Gaga concert in this example.

The network 104 is an electronic communication network that facilitatescommunication between the computing device 102, the media server 106,and the media-playback device 112. An electronic communication networkincludes a set of computing devices and links between the computingdevices. The computing devices in the network use the links to enablecommunication among the computing devices in the network. The networkcan include routers, switches, mobile access points, bridges, hubs,intrusion detection devices, storage devices, standalone server devices,blade server devices, sensors, desktop computers, firewall devices,laptop computers, handheld computers, mobile telephones, vehicularcomputing devices, and other types of computing devices.

In various embodiments, the network 104 includes various types of links.For example, the network 104 can include wired and/or wireless links,including Bluetooth, ultra-wideband (UWB), 802.11, ZigBee, cellular, andother types of wireless links. Furthermore, in various embodiments, thenetwork 104 is implemented at various scales. For example, the network104 can be implemented as one or more local area networks (LANs),metropolitan area networks, subnets, wide area networks (such as theInternet), or can be implemented at another scale. Further, in someembodiments, the network 106 includes multiple networks, which may be ofthe same type or of multiple different types.

FIG. 2 is a schematic block diagram of the computing device 102 ofFIG. 1. The computing device 102 may be used by an individual who wishesto generate a media trailer. The media trailer may be utilized topromote a particular event associated with a particular artist. In someembodiments, a promoter wants to generate a preview of a concertfeaturing more than one artist. In such instances, a trailer may includeclips of songs from multiple artists that are involved in the concert.

The computing device 102 includes an input device 202. The input device202 receives inputs from a user. The input device 202 is one or more ofa keyboard, a touchscreen, a voice recorder, and a mouse. The outputdevice 204 provide audio and visual output. The output device 204 is oneor more of a display, a speaker, headphones, a touchscreen, and otheraudio outputs. In some embodiments, a touch screen operates as both anoutput device and a user input device. In some embodiments, user inputis not required and the media trailer is automatically generated.

In at least some embodiments, the computing device 104 includes aprocessor 208 and a memory device 210. In some embodiments, theprocessor 208 comprises one or more central processing units (CPU). Inother embodiments, the processor 208 additionally or alternativelyincludes one or more digital signal processors, field-programmable gatearrays, or other electronic circuits. The processor 208 executesinstructions stored in the memory device 210.

The memory device 210 operates to store data and instructions. In someembodiments, the memory device 210 stores instructions for a trailergeneration interface 118, and a media-playback engine 216. In someembodiments, the media-playback engine 170 selects and plays back mediacontent and generates interfaces for selecting and playing back mediacontent items.

The trailer generation interface 118 prompts a user to input informationutilized to produce an artist trailer. In some embodiments the user isprompted to enter one or more artists whose songs are to be included inthe trailer. The user may also be prompted to specify a duration of thetrailer. This process is further described with respect to FIG. 6. Thetrailer generation interface 118 also plays previews of media trailersand enables users to edit the trailers. In some embodiments, the trailergeneration interface 118 is not utilized because the media trailer isautomatically generated based on pre-defined criteria. For example, anartist trailer may be automatically generated when a new artist becomesavailable on a music streaming service and the criteria are pre-definedby the music streaming service.

Some embodiments of the memory device also include a media content cache218. The media content cache 218 stores media-content items, such asmedia content items that have been previously received from the mediaserver application 302. The media content items stored in the mediacontent cache 218 may be stored in an encrypted or unencrypted format.The media content cache 218 can also store decryption keys for some orall of the media content items that are stored in an encrypted format.The media content cache 218 can also store metadata about media-contentitems such as title, artist name, album name, length, genre, mood, era,etc. The media content cache 218 can also store playback informationabout the media content items, such as the number of times the user hasrequested to playback the media content item or the current location ofplayback (e.g., when the media content item is an audiobook, podcast, orthe like for which a user may wish to resume playback). In someembodiments, the playlist can be stored in the media content cache 218for later playback.

The memory device 210 typically includes at least some form of computerreadable media. Computer readable media includes any available mediathat can be accessed by the computing device 102. By way of example,computer readable media include computer readable storage media andcomputer readable communication media.

Computer readable storage media includes volatile and nonvolatile,removable and non-removable media implemented in any device configuredto store information such as computer readable instructions, datastructures, program modules, or other data. Computer readable storagemedia includes, but is not limited to, random access memory, read onlymemory, electrically erasable programmable read only memory, flashmemory and other memory technology, compact disc read only memory, blueray discs, digital versatile discs or other optical storage, magneticcassettes, magnetic tape, magnetic disk storage or other magneticstorage devices, or any other medium that can be used to store thedesired information and that can be accessed by the computing device102. In some embodiments, computer readable storage media isnon-transitory computer readable storage media.

Computer readable communication media typically embodies computerreadable instructions, data structures, program modules or other data ina modulated data signal such as a carrier wave or other transportmechanism and includes any information delivery media. The term“modulated data signal” refers to a signal that has one or more of itscharacteristics set or changed in such a manner as to encode informationin the signal. By way of example, computer readable communication mediaincludes wired media such as a wired network or direct-wired connection,and wireless media such as acoustic, radio frequency, infrared, andother wireless media. Combinations of any of the above are also includedwithin the scope of computer readable media.

The network access device 212 operates to communicate with othercomputing devices over one or more networks 104. Examples of the networkaccess device 212 include one or more wired network interfaces andwireless network interfaces. Examples of wireless network interfacesinclude infrared, BLUETOOTH® wireless technology, 802.11a/b/g/n/ac, andcellular or other radio frequency interfaces.

FIG. 3 is a schematic block diagram of the media server of FIG. 1. Themedia server 106 transmits stream media to media-playback devices suchas the media-playback device 112. The media server 106 also generatesmedia trailers based on inputs received from computing devices such asthe computing device 102. In some embodiments, the media server 106includes a media server application 302, a trailer generationapplication 108, a processing device 306, a memory device 1308, and anetwork access device 310. The processing device 306, memory device 308,and network access device 310 may be similar to the processor 208,memory device 210, and network access device 212 respectively, whichhave each been previously described.

In some embodiments, the media server application 302 accesses andstreams music or other audio, video, or other forms of media content.The media server application 302 includes a media stream service 314, amedia data store 316, and a media application interface 318. The mediastream service 314 operates to buffer media content such as mediacontent items 326, 328, and 330, for streaming to a media-playbackdevice 112.

The media application interface 318 can receive requests or othercommunication from media-playback devices or other systems, to retrievemedia content items from the media server 106.

In some embodiments, the media data store 316 stores media content items332, media content metadata 334, and playlists 336. The media data store316 may comprise one or more databases and file systems. The mediacontent items 332 may be audio, video, or any other type of mediacontent, which may be stored in any format for storing media content.

The media content metadata 334 operates to provide various informationassociated with the media content items 332. In some embodiments, themedia content metadata 334 includes one or more of title, artist name,album name, length, genre, mood, era, etc. In some embodiments, themedia content metadata 334 includes characteristics about media contentrelevant to its energy level or popularity. In some embodiments, themedia content metadata 334 records information about one or more users'preferences related to a media content item.

The playlists 336 operate to identify one or more of the media contentitems 332 and assemble the media content items together into a playlist.In some embodiments, the playlists 336 identify a group of the mediacontent items 332 in a particular order. In other embodiments, theplaylists 336 merely identify a group of the media content items 332without specifying a particular order. Some, but not necessarily all, ofthe media content items 332 included in a particular one of theplaylists 336 are associated with a common characteristic such as acommon genre, mood, era, or appropriateness for particular activities.The playlists 336 may include user-created playlists, which may beavailable to a particular user, a group of users, or to the public.

The trailer generation application 108 communicates with the computingdevice 102 and the media-playback device 112 to present automaticallygenerated media trailers. In some embodiments, criteria for the mediatrailer is received from the computing device 102. In other embodiments,the criteria is pre-defined and stored at the media server. In someembodiments, the criteria specifies one or more musical artists and aduration of a trailer or media preview. Based on this criteria, themedia item selection engine 340 selects two or more media content itemsto include in the media trailer. The media item selection engine 340utilizes media content metadata 334 to select media content items 332from the media data store 316. After the media content items 332 havebeen selected, the clip selection engine 342 analyzes the media contentitems 332. Portions of the media content items 332 are selected to beincluded in the artist trailer.

Once the clips have been selected, the arrangement engine 344automatically puts the clips in order and creates transitions betweenthe clips. The order of the clips may be determined by one or more ofenergy, tempo, and popularity. The transitions are created by fading inand out as well as optionally matching beats. The equalizer 346normalizes the volume and levels of the media content to match themtogether.

In some embodiments, the visual effects engine 348 receives visualcontent from the computing device 102 and adds the visual content to themedia trailer. The visual content may be one or more of an image and avideo. For example, to promote a concert, the visual content may be aconcert poster image. In some embodiments, the audio effects engine 350receives audio content from the computing device 102 and adds the audiocontent to the artist trailer. The audio content is typically spoken orverbal content. The spoken content may describe an event associated withone or more artists such as a concert or an album release. In someembodiments, the spoken content is recorded by the same artist that isbeing promoted by the trailer.

FIG. 4 is a schematic block diagram of the media-playback device 112 ofFIG. 1. In some embodiments, the media-playback device 112 includes atouch screen 402, a content output device 404, a processing device 406,a memory device 408, a location-determining device 410, and a networkaccess device 412.

The touch screen 402 operates to receive inputs from a selector (e.g., afinger, stylus, etc.) controlled by a user such as the consumer C. Insome embodiments, the touch screen 402 operates as both a display deviceand a user input device. In some embodiments, the touch screen 402detects inputs based on one or both of touches and near-touches. In someembodiments, single taps and/or swipes are recognized as inputs. In someembodiments, the touch screen 402 displays a user interface 414 forinteracting with the media-playback device 112. Some embodiments do notinclude a touch screen 402. Some embodiments include a display deviceand one or more separate user interface devices. Further, someembodiments do not include a display device.

The content output device 404 operates to output media content. In someembodiments, the content output device 404 generates media output 114for the consumer C. Examples of the content output device 404 include aspeaker assembly comprising one or more speakers, an audio output jack,a Bluetooth transmitter, a display panel, and a video output jack. Otherembodiments are possible as well. For example, the content output device404 may transmit a signal through the audio output jack or Bluetoothtransmitter that can be used to reproduce an audio signal by a connectedor paired device such as headphones, speaker system, or vehiclemedia-playback system.

In some embodiments, the processing device 406 comprises one or morecentral processing units (CPU). In other embodiments, the processingdevice 406 additionally or alternatively includes one or more digitalsignal processors, field-programmable gate arrays, or other electroniccircuits.

The processing device 406, memory device 408, and network access device412 may be similar to the processor 208, memory device 210, and networkaccess device 212 respectively, which have each been previouslydescribed.

The memory device 408 includes a media-playback engine 408 and a mediacontent cache 420. The media-playback engine 408 and the media contentcache 420 may be similar to the media-playback engine 216 and the mediacontent cache 218, which have each been previously described. In someembodiments, the media-playback engine 418 receives and plays mediacontent received from the media server 106. The media content includesartist trailers generated by the media server 106. In some embodiments,artist trailers are played between songs or other media content that arestreamed from the media server 105 to the media-playback engine 418.

The location-determining device 410 is a device that determines thelocation of the media-playback device 112. In some embodiments, thelocation-determining device 410 uses one or more of the followingtechnologies: Global Positioning System (GPS) technology, cellulartriangulation technology, network-based location identificationtechnology, Wi-Fi positioning systems technology, and combinationsthereof.

The network access device 412 may be similar to the network accessdevice 212, which has been previously described.

Each of the computing device 104, media server 106, and media-playbackdevice 112 can include additional physical computer or hardwareresources. In at least some embodiments, the media-playback device 112and computing device 104 communicates with the media server 106 via thenetwork 104

Although in FIG. 2 only a single media-playback device 112, a singlecomputing device 102, and a single media server 106 are shown, inaccordance with some embodiments, the media server 106 can support thesimultaneous use of multiple media-playback devices and computingdevices. The media-playback device and computing device cansimultaneously access media content from multiple media servers.

FIG. 5 illustrates an example method 500 of automatically generating amedia trailer performed by some embodiments of the trailer generationapplication 108. Such a method can be used, for example, to build amedia trailer to promote a particular event associated with an artist. Apromoter may wish to promote an upcoming tour for a particular artist bypresenting previews of popular songs by the artist to consumers thatlike to listen to streamed music. However, the promoter may not have thetime or ability to put together an effective preview of the artist'smusic. The promoter may be using a computing device, such as thecomputing device 102 of FIG. 1 to input criteria for the media trailer.The criteria is communicated to a media server such as the media server106 in FIG. 1 to automatically generate a media trailer for the artist.The promoter may desire that the media server 106 communicate theautomatically generated artist trailer to consumers listening to mediacontent on media-playback devices such as the media-playback device 112of FIG. 1. In some embodiments, some of the operations of the method 500are automated and do not require input from a user to generate the mediatrailer.

At operation 502, the media server 106 receives inputs from thecomputing device 102 specifying one or more artists to include in themedia trailer. Input specifying a duration for the trailer is alsoreceived. The duration determines the amount of media content that canbe included in the program. FIG. 6 further describes receiving inputfrom a computing device specifying criteria for a media trailer.Alternatively, in some embodiments, the artists and duration areautomatically selected based on pre-defined criteria stored at the mediaserver 106.

At operation 504, the media server 106 automatically selects clips ofrepresentative media items to include in the media trailer. Media itemsinclude one or more of songs, podcast clips, portions of comedy shows,portions of audiobooks, and other media items. Multiple factors may beconsidered for choosing appropriate media items and which clips toselect from those media items. FIG. 8 further describes the clipselection process.

At operation 506, the selected media clips are automatically assembledarranged and combined. This process is further described with respect toFIG. 11.

At operation 508, the media server 106 optionally receives additionalcontent and integrates the content with the media clips to generate amedia trailer. In some embodiments, the additional content is visualcontent such as an image or a video. In some embodiments, the additionalcontent is audio content such as spoken dialogue. The visual contentand/or audio content may describe a particular event associated with theone or more artists featured in the media trailer.

At operation 510, the media trailer is presented to the user. In someembodiments, the user is a promoter P using a computing device 102 torequest generation of an artist trailer. In other embodiments, the mediatrailer is not presented to the user and is instead saved directly tothe media server 106. In at least some embodiments, the media preview ispresented to the user via an output device 204 on the computing device104 such as speakers or headphones. The promoter P can review the mediatrailer and make changes. Upon approval, the media trailer is submittedto the media server 106 where it may be accessed by media-playbackdevices such as the media-playback device 112 shown in FIG. 1.

FIG. 6 illustrates an example method 502 of receiving input specifyingcriteria for a media trailer performed by some embodiments of thetrailer generation application 108. The criteria is used toautomatically generate a media trailer at the media server 106 based oninput received from a user.

At operation 602, input is received from a computing device 104specifying one or more artists. In at least some embodiments, the inputis received through a user interface, such as the trailer generator userinterface 700 depicted in FIG. 7. The artist selection dictates whichmedia items will be included in the media trailer. In some embodiments,a musical artist is selected and songs by that musical artist areselected for inclusion in the media trailer. In other embodiments,multiple artists are selected because they are performing together at anevent and the media preview will include songs by those artists. Inother embodiments, the artist is a comedian, a spoken word actor, anaudiobook author, a podcast host, or other artist.

At operation 604, input is received from the computing device 102specifying a duration for the media trailer. In at least someembodiments, the input is received through a user interface, such as thetrailer generator user interface 700 depicted in FIG. 7. In someembodiments, the duration is defined as a length of time. For example,in some embodiments, the duration of the media trailer is from 5 to 180seconds, 10 to 150 seconds, 15 to 120 seconds, 20 to 90 seconds, or 30to 60 seconds. In other embodiments, the duration is defined by a numberof media items to be included in the media trailer. For example, in someembodiments, the media trailer is from 2 to 10 media items, 3 to 8 mediaitems, 4 to 6 media items, or 5 media items in length.

At operation 606, input is optionally received from the computing device102 specifying an audience for the media trailer. In some embodiments,the user of the computing device 104 can select to create a mediatrailer where the media content is selected for a general group or atarget individual. In at least some embodiments, the input is receivedthrough a user interface, such as the trailer generator user interface700 depicted in FIG. 7.

At operation 608, additional content is optionally received from thecomputing device 102. In at least some embodiments, the additionalcontent is submitted to the media server 106 through the trailergenerator user interface 700 depicted in FIG. 7. In at least someembodiments, the additional content is one or more of visual content andaudio content. The visual content may be an image or a video. The audiocontent may be verbal audio content. In some embodiments, the additionalcontent provides details for an event related to the one or more artistsselected in operation 602.

FIG. 7 illustrates an example trailer generator user interface 700 thatis used in some embodiments to receive input from a promoter P at acomputing device 102. The trailer generator user interface 700 ispreferably a visual interface that is displayed on a computing devicesuch as the computing device 102 depicted in FIG. 1. In someembodiments, the trailer generator user interface 700 includes a trailertitle field 702, an artist field 704, a trailer duration selector 706,an audience selector 708, and a content upload field 710. Not all fieldsare included in all embodiments. The title field 702, audience selector708, and content upload field 710 are optional.

In the example shown in FIG. 7, a promoter has entered a title “JoanneWorld Tour” into the trailer title field 702. The trailer title field702 serves to identify the media trailer for the convenience of thepromoter. In some embodiments, the promoter may create multiple mediatrailers for the same or different events that involve the same ordifferent artists. Titles can be used to describe and distinguish themedia trailers.

In the example shown in FIG. 7, a promoter has entered the artist “LadyGaga” into the artist field 704. This input indicates that songs by theartist Lady Gaga should be included in the media trailer to promote herworld tour. In some embodiments, the promoter can or enter two or moreartists so that songs by the two or more artists would be included inthe media trailer.

In the example shown in FIG. 7, a promoter has selected “30 seconds” atthe trailer duration selector 706. In this example, a pull down menu isprovided for the promoter to select from pre-determined choices forduration. In other embodiments, the promoter could enter any value fortime in the trailer duration selector 706. For example, the mediatrailer could have a duration of from 5 to 180 seconds or from 30 to 60seconds. In some embodiments, the duration is determined by selecting anumber of songs or media content items to include in the media trailer.

The promoter P can optionally select an audience for the media previewby utilizing the audience selector 708. In the example of FIG. 7, radiobuttons 709 are provided to allow the promoter to select either“individually targeted” or “general population” for the audience. Otherselectors could be used such as a pull-down menu. Other options foraudiences may be possible, such as a smaller subset of the generalpopulation limited by geography or age or targeting people who listen tothe artist.

The promoter can optionally upload additional content to add to themedia trailer. The content upload field 710 can be populated byselecting a file from the computing device 104. Here, the promoter wouldselect the browse button 712 to open a window to select a file toupload. The file title or location would then appear in the contentupload field 710. The additional content could be an audio file, animage file, or a video file. In some embodiments, the promoter canupload two or more additional content files.

Once the promoter 102 has input all of the desired criteria andinformation for the media trailer, the promoter selects the save button714 on the trailer generator user interface 700. The trailer generatoruser interface 700 is just one example of how a promoter or other usermight input criteria for generating a media preview or media trailer.Other alternatives and variations are possible.

FIG. 8 illustrates an example method 504 of automatically selectingclips of representative media items to include in a media trailer, aswas described in FIG. 5. This method determines which media items toinclude in the media trailer as well as which clips of the media itemsto include in the media trailer.

At operation 802, the audience for the media trailer is determined. Fortrailers targeting individuals, the method proceeds to operation 806.For trailers targeting a general population, the method proceeds tooperation 804. At operation 804, the average music taste of the generalpopulation is determined. The popularity of the media items on averageis used to select the media items to include in the media preview atoperation 812.

At operation 806, for media previews targeted at individuals, it isdetermined whether the targeted individual has played a song by theselected artist before. If the individual has played a song by theartist previously, the method proceeds to operation 810. If theindividual has not played a song by the artist before, the methodproceeds to operation 808. At operation 808, an example 40 dimensionalmethod is applied to determine which media items are best suited for theindividual. The 40 dimensional method utilizes comparisons between auser's vector and vectors of individual music tracks. A user's vector iscreated by averaging the vectors of tracks that the user has listenedto, inversely weighted by recency. Vectors are obtained, for example,from an algorithm described in U.S. Pat. No. 9,547,679. Highersimilarity values between the user's vector and a given track's vectorindicate that the user is more likely to like the track. Othertechniques or other quantities of vectors can be used in otherembodiments.

At operation 810, affinity scoring is used to determine which mediaitems are best suited for the individual. The affinity score is ameasure of how much a user like a track that the user has listened tobefore. The score is based on the number of times the user has playedthe track, how long ago it was played, how often the track was played bythe user, and how often the user has skipped playing the track. Higheraffinity scores indicate that the user is more likely to like a song.Additional details about affinity scores are described in U.S. Pat. No.9,547,679.

At operation 812, one of 40 dimensional scoring from operation 808,affinity scoring from operation 810, and average tastes from operation804 are used to select the media items (songs) to include in the mediatrailer for the selected artist(s). Once the songs have beenautomatically selected, the method proceeds to operation 814.

At operation 814, the selected songs are analyzed for various aspects oftheir content including repetition, volume, energy, uniqueness, sectionboundaries, drops, beats, bars, and voice presence. This analysisidentifies portions of the songs that are most recognizable to listenerssuch as a chorus, a hook, or another distinctive portion of the song.

Once the songs have been analyzed, clips or portions of the songs areautomatically selected. If the media item is a portion of a comedy show,for example, the most recognizable portion may be a punchline of a joke.If the media item is a song, the most recognizable portion may beselected such as a chorus of the song. The clips have an automaticallydefined duration. The length of the clips is determined based on theoverall length of the media preview and the number of media items thatare included. For example, if the media preview is 30 seconds long and 5songs are included, each clip will have an average length of 6 seconds.

FIG. 9 illustrates an example of song selections made according tooperation 812 of the method of FIG. 8. In this example, Lady Gaga is thechosen artist and the audience is individually targeted. FIG. 9 showsthat four songs (902, 904, 906, 908) have been selected to fill the 30second duration of the media trailer. The songs shown were selected foran individual who has played Lady Gaga songs previously, so affinityscoring was used to select the songs. The affinity score 910 isdisplayed next to the song title in FIG. 9. The song 904 “PerfectIllusion” has an affinity score of 3, while the song 906 “Poker Face”has an affinity score of 11. This indicates that the user likes “PokerFace” more than “Perfect Illusion” based on listening behavior of theuser. However, the scores of these songs are higher than other Lady Gagasongs, so lower scoring songs are not included. For example, a song withan affinity score of 2 or less would not be included in this particularmedia trailer.

FIGS. 10A-C illustrate the analysis and selection of clips from theselected media items to be included in the media preview according tothe method of FIG. 8. Various methods of analyzing media items can beemployed to identify distinctive portions of the media items. The clipscan vary in duration depending on the overall duration of the mediatrailer and the number of media items included in the media trailer.Each clip could be from 2 to 30 seconds in duration or from 5 to 10seconds in duration. Other durations are possible.

In FIG. 10A, the song 1002 has been analyzed to identify repetitive andhigher volume portions, which here are the chorus 1004 of the song 1002.The clip 1006 is then selected from one of the chorus 1004 portions ofthe song. Verbal recognition can be employed to identify the beginningof a chorus or verse.

In FIG. 10B, the song 1012 has been analyzed to find a particularlydistinctive portion based on volume and energy. Volume is represented inFIG. 10B by a waveform where larger peaks represent portions of the song1012 with greater energy and volume. Here, the song is an electronicmusic song that does not have a lot of verbal content. The “drop,” orbig burst of energy following a lull in the song, has been captured inthe selected clip 1016.

In FIG. 10C, the song 1022 has been analyzed for energy, which is againrepresented as a waveform. Energy may be a useful metric for classicalmusic that does not have lyrics. The climax of the song is included inthe selected clip 1026.

FIG. 11 illustrates an example method 506 of automatically assemblingselected media clips into a media trailer. The clips are put together tocreate a cohesive media presentation.

At operation 1102, the selected clips are arranged. The clips are putinto a particular order based on various criteria. Such criteria couldinclude popularity, energy, tempo, and mood.

At operation 1104, the selected clips are equalized to match the levelsof the clips. Equalizing can include matching the volume, timbre, andsound quality of the media clips. Equalizing the clips makes them blendtogether and create smoother transitions between the clips, avoidingabrupt changes.

At operation 1106, the selected clips are blended together bycross-fading and/or beat matching. Cross-fading reduces the volume of afirst clip and increases the volume of a second clip at the same timesuch that two clips overlap slightly. Cross-fading smooths thetransitions between clips. Additionally, beat matching can be utilizedto smooth transitions between clips. By matching beats, hiccups in theplayback of the music can be avoided.

FIG. 12 illustrates the process 1102 of arranging media clips accordingto the method of FIG. 11. Four clips (1202, 1204, 1206, 1208) arearranged into a particular order for the media trailer. As was describedwith respect to operation 1102 of FIG. 11, various criteria can beconsidered for choosing an order of clips such as energy, tempo, andpopularity. Here, clip 1208 is first followed by 1202, 1206, and finally1204.

FIG. 13 illustrates the process 1104 of equalizing media clips accordingto the method of FIG. 11. The four clips (1202, 1204, 1206, 1208) areequalized. In this example, the waveforms represent volume. Otheraspects of the clips can be equalized such as timbre and pitch. Theclips initially have a large variation in volume, as shown by thewaveforms. After equalization, the volumes of the clips are much moreeven.

FIG. 14 illustrates the process of cross-fading between media clipsaccording to the method of FIG. 11. Cross-fading is provided between thefour clips (1202, 1204, 1206, 1208) to blend them together. The clipsoverlap slightly as one clip is faded out and the next clip is faded in.For example, clip 1208 fades out as clip 1202 fades in at the crossfade1242. Other blending and smoothing operations can be utilized such asbeat-matching, echo, sound effect matching, direct cut, and spinback.

FIG. 15 illustrates an example trailer generation user interface 700showing a preview of the media trailer on the computing device 104. Amedia player window 1302 is displayed showing the visual content 1304that was added to the media preview. In this example, a Lady Gaga tourposter image is included with additional text information provided abouta particular concert date and location. Navigation controls 1306 for themedia player are provided to manipulate playback of the media preview. Auser can select the “edit” button 1308 to modify the media trailer orselect the “submit” button 1310 to approve of the media trailer and sendit back to the media system 106 for distribution.

The completed media trailer is communicated to the media server 106through a network 104 for storage in the media data store 316. The mediatrailers 352 are stored with the media content items 332. As consumers,such as consumer C, listen to media through media-playback devices 112,media content is streamed by the media stream service 314. The mediastream service 314 accesses media content items 332, including mediatrailers 352, and communicates the media content items 332 to theconsumers.

FIG. 16 illustrates an example of a display 1600 of a media trailer on amedia-playback device 112. The visual content 1304 is displayed on themedia-playback device 112 as the media output 114 of the media traileris being played. The consumer C can navigate playback using the controls1604 on the touch screen. A consumer may hear a media trailer betweensongs on an ad-based music streaming service.

The various embodiments described above are provided by way ofillustration only and should not be construed to limit the claimsattached hereto. Those skilled in the art will readily recognize variousmodifications and changes that may be made without following the exampleembodiments and applications illustrated and described herein, andwithout departing from the true spirit and scope of the followingclaims.

1. A computer-implemented method of automatically generating acustomized media trailer comprising: receiving input specifying one ormore artists; receiving input specifying a duration of the mediatrailer; receiving input specifying an audience for the media trailer;analyzing media consumption history of the specified audience;automatically selecting two or more representative media items for theone or more artists based on the media consumption history of thespecified audience; automatically selecting a clip of each of therepresentative media items; automatically arranging and equalizing theclips of the representative media items to generate the media trailerhaving the specified duration; and streaming the media trailer to one ormore media-playback devices.
 2. The method of claim 1, wherein theinputs for artists, duration, and audience are received automaticallybased on pre-defined criteria received from a media server. 3.(canceled)
 4. The method of claim 3, wherein the duration is a length oftime from 5 to 180 seconds.
 5. The method of claim 3, wherein theduration is a length of time from 30 to 60 seconds.
 6. (canceled)
 7. Themethod of claim 1, wherein the media items are one or more of songs,podcast clips, portions of comedy shows, and portions of audiobooks. 8.The method of claim 1, wherein the inputs for artists, duration, andaudience are received from a user computing device.
 9. (canceled) 10.(canceled)
 11. The method of claim 1, wherein the audience is a generalpopulation of users and the one or more representative media items areselected by the average music tastes of a general population.
 12. Themethod of claim 1, wherein the audience is an individual that haspreviously played media items by the artist and the one or morerepresentative media items are selected by affinity score, wherein theaffinity score measures how much the individual likes a media item. 13.The method of claim 1, wherein the audience is an individual that hasnot previously played media items by the artist and the one or morerepresentative media items are selected by 40 dimensional score.
 14. Themethod of claim 1, wherein the clip of each representative media item isselected by one or more of the following: repetition, volume, energy,uniqueness, and voice presence.
 15. The method of claim 1, wherein theduration of the clip of each representative media item is automaticallydetermined based on the overall duration and number of media itemsincluded in the media preview.
 16. The method of claim 1, wherein theclip of each representative media item is from 2 to 30 seconds induration.
 17. The method of claim 1, wherein the clip of eachrepresentative media item is from 5 to 10 seconds in duration.
 18. Themethod of claim 1, wherein the clips of the representative media itemsare arranged based on one or more of energy, tempo, popularity, andtimbre.
 19. The method of claim 1, wherein the clips of therepresentative media items are equalized to match beats, volume, soundquality, and style.
 20. The method of claim 1, further comprisingautomatically providing cross-fading between the clips of therepresentative media items.
 21. The method of claim 1, furthercomprising receiving additional content to add to the media preview. 22.The method of claim 21, wherein the additional content is visual contentcomprising an image or a video.
 23. The method of claim 21, wherein theadditional content is spoken content describing an event related to theone or more artists.
 24. One or more computer readable media encodinginstructions embodied thereon that, when executed by one or morecomputing devices, cause the computing device to perform a method ofautomatically generating a customized artist trailer: receive aselection of one or more musical artists; receive an input specifying aduration of an artist trailer; receive an input of an audience for theartist trailer, wherein the audience is one of general population andtarget individual; analyze media consumption history of the audience;receive an input of additional media content related to an eventinvolving the one or more musical artists, the additional media contentcomprising one or more of images, videos, and spoken content;automatically select one or more representative songs for the one ormore musical artists based on the media consumption history of theaudience; automatically select a portion of each song; automaticallyarrange and equalize the portions of the songs; integrate the additionalmedia content with the portions of the songs to produce the artisttrailer; present a preview of the artist trailer on an output device ofthe computing device; and save the artist trailer to a media server. 25.The computer readable media of claim 24, wherein the event is one of analbum release, a concert, a tour, introduction of a new artist, and apublic appearance.
 26. The computer readable media of claim 24, whereinthe additional media content is one or more of a video clip of aconcert, a clip of a music video, an album cover art image, a tourposter image, an image of the artist, and an image related to the event.27. The computer readable media of claim 24, wherein the additionalmedia content is a vocal recording providing spoken information aboutthe event.
 28. A system for automatically generating a customized artisttrailer comprising a media server comprising: a trailer generationapplication configured to: receive input from a computing devicespecifying an artist and a duration for the artist trailer; receiveinput specifying an audience for the artist trailer; analyze mediaconsumption history of the specified audience; automatically selectclips from two or more songs by the artist based on the mediaconsumption history of the specified audience; and automatically arrangeand combine the clips into an artist trailer; and a media streamingservice configured to stream media content and artist trailers to one ormore media-playback devices.
 29. The system of claim 28, wherein thecomputing device comprises: a user interface; a processor; and memoryencoding instructions that, when executed by the processor, cause thecomputing device to: prompt a promoter to input an artist and a durationfor the artist trailer; communicate the input to the media server; andreceive a preview of the artist trailer.
 30. The system of claim 29,wherein the media-playback device comprises: a processing device; andmemory encoding instructions that, when executed by the processor, causethe computing device to receive and play media content received from themedia server, wherein the media content includes artist trailers. 31.The system of claim 28, wherein the audience is an individual target andthe artist trailer is customized to the individual using a 40dimensional scoring.
 32. The system of claim 28, wherein the audience isan individual target and the artist trailer is customized to theindividual using affinity scoring.
 33. The system of claim 28, whereinthe audience is a general population and the artist trailer iscustomized based on the average tastes of the general population. 34.The method of claim 1, wherein streaming the media trailer to one ormore media-playback devices comprises streaming the media trailerbetween two media items.